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REPROGRAMMABLE DIGITAL WIRELESS COMMUNICATION DEVICE 
AND METHOD OF OPERATING SAME 

This application claims priority to the provisional patent applications with the 
following Serial Numbers: 60/133,141; 60/133,137; 60/133,129; and 60/133,135, each 
of which was filed on May 7, 1999. 

5 BRIEF DESCRIPTION OF THE INVENTION 

This invention relates generally to wireless communication devices. More 
particiilarly, this invention relates to a reconfigurable digital wireless communication 
device. 

10 BACKGROUND OF THE INVENTION 

Existing communication devices are "static" devices. Thai is, they are 
designed to support a specific wireless communication standard and/or to support a 
specific application (e.g., voice, data transmission) at a specific data rate. Typically, 
different wireless communication standards are used in different wireless networks, 
15 both within a geographic locality and worldwide. Thus, an individual traveling 

between different regions is required to use a separate wireless communication device 
in each region. 

In addition, the advent of new and evolving user applications and services may 
necessitate redesign of static wireless conmiunication infrastructure and terminals. 
20 Thus, an individual or service provider who wants to utilize or enable such services is 
required to replace or upgrade equipment. 

1 
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Figure 1 illustrates a digital commimication modem 20 that may be 
implemented in accordance with prior art approaches to multi-standard conrunimication 
devices. A transmitter 22 transmits a signal to a channel 24. which may be a wireless 
or physical channel. The transmitted signal is received at the modem 20, where it is 

5 initially processed by a Radio Frequency (RF) subsystem 26. The RF subsystem 26 
performs analog mixing, analog filtering, and analog gain control functions. The 
analog signal from the RF subsystem 26 is then converted to an equivalent digital 
signal by an analog-to-digital converter 28. 

The digital signal is then processed by a digital front-end processing circuit 30, 

iO which performs standard-specific, channel-specific, and modulation-specific 

bandwidth selection, filtering, sampling-rate control and other signal processing. The 
signal from the digital front-end processing circuit 30 is then passed to,a 
detector/demodulator circuit 32. which performs signal detection and demodulation 
operations. The detection and demodulation circuit 32 also interacts with a parameter 

15 estimation circuit 34. The output from the detection and demodulation circuit 32 is 
subsequently processed by a channel decoder 36 and then a source decoder 38. 

Figure 2A shows a prior art architecture for implementing the structure of 
Figure 1 . An RF subsystem 26 is again used to provide one or more bandpass signals 
(intemiediate frequency signals), which are then digitized by the analog-to-digital 

20 converter 28, which is typically implemented as a free-running analog-to-digital 
converter. The output from the analog-to-digital convener 28 is placed on a bus 52. 
Signals from the bus 52 are routed lo hardware processors 60 and software 
programmable processors 70. The hardware processors include programmable logic 
device 62 and fixed-function logic 64. The software programmable processors include 

25 digital signal processor 72 and microprocessor 74. 

The digital components (30. 32. 34. 36. and 38) of the device 20 are typically 
implemented on software-programmable processors 70. or as a ftilly hardwired, non- 
programmable application-specific integrated circuit 64. The hardwired circuit may be 
augmented by a programmable logic device 62. which provides limited fme- 

30 granularity programmability. The device 50 of Figure 2A facilitates the management 
of ftmctions executing on the programmable logic device 62 via controlling the 
download of ftinctionality into the programmable logic device 62 and control of the 

0 
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dataflow into and out of the programmable logic device 62. The software- 
programmable processors 70 typically comprises a digital signal processor 72 and a 
control microprocessor 74. For lower bandwidth applications (tens of kbps), software- 
programmable digital signal processors 72 are typically used to perform requisite 
5 signal processing functions. For high bandwidth applications (tens of Mbps), a fully 
hardwired approach is typically employed. The general purpose microprocessor 74 
typically performs control and other functions. Accordingly, the signal processing 
device 50 can be highly optimized only for a particular communication standard, 
service and application. Prior art approaches to accommodating multiple standards, 
10 services and applications have essentially consisted of combining the disparate 

hardware and software resources separately optimized for each service of interest. This 
results in poor efficiency in terms of size, weight and power consumption. 

Figure 2B illustrates the control architecture 80 for a prior art multi*standard 
communication apparatus. The architecture 80 includes executive code 88, which is 
15 effectively an operating system running on microprocessor 74 or digital signal 

processor 72. One of a suite of applications 84 is selected to run under the operating 
system. Each application 84 executes a set of software/hardware functions 82A-C. 
Each application 84 requires computation resources, which are available according to 
Figure 2A. either via a microprocessor, a digital signal processor, a fixed-function 
20 logic engine, or a programmable logic engine. Thus, each of the applications requires 
some combination of these resources: the actual panitioning among their use is 
determined by the product/application requirements. For example, operation on a 
portable device favors much of the functionality being implemented on dedicated, 
fixed-function hardwired logic devices. On the other hand, product flexibility and 
25 upgrade-ability requires use of completely programmable components in the platform, 
such as microprocessors, digital signal processors, and programmable logic. The 
approach of Figure 2B is also inefficient, as it essentially relies upon sequentially 
selecting one of the sets of disparate and redundant hardware and software resources 
discussed in connection with Figure 2A. 
30 The poor efficiency of the prior art approaches discussed is evident from 

Figure 3, which depicts energy efficiency vs. flexibility for the various architectural 
elements of device 50. The highly efficient fixed-hardware resources are highly 
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inflexible, so tiiat considerable replication (and therefore redundancy) is incurred. At 
the opposite extreme, the highly flexible programmable logic devices, embedded 
processors and software-programmable digital signal processors (DSPs) that may be 
employed are inefficient with respect to power and size. These design tradeoffs result 
5 in the Energy-Efficiency Gap 92 shown in Figure 3. 

In view of the foregoing, it would be highly desirable to provide a single 
wireless communication device that can efficiently and cost-effectively support 
multiple wireless communication standards and applications. 



10 SUMMARY OF THE INVENTION 

A digital wireless communication device comprises a software-progranmiable 
processor, a heterogeneous reconflgurable muhiprocessing logic circuit, and a bus 
connecting the software-programmable processor and the heterogeneous 
reconflgurable multiprocessing logic circuit. The software-programmable processor is 

15 selected from the group comprising: a digital signal processor and a central processing 
unit. The heterogeneous reconflgurable multiprocessor comprises a set of 
heterogeneous signal processing kernel s and a reconfipurable data route r 
interconnecting the heterogeneous signal processing kern els. The signal processing 
kernels and data router are controlled by the software-programmable processor via 

20 control busses. The design of the heterogeneous reconflgurable multiprocessor is 
aided by an analysis method referred to as profiling. 

The invention establishes a new architecture for multiple-service, multiple- 
standard digital communication devices. The platform enables the same hardware 
resources to be reconfigured in order to provide more flexible delivery of arithmetic 

25 and control operations via techniques of heterogeneous reconflgurable 

multiprocessing. The architecture is reprogrammable through the control of software 
resident on or downloaded into memory, which may be used by the microprocessor, 
digital signal processor, or heterogeneous reconflgurable muhiprocessing logic. The 
architecture facilitates new. broadband services over a wireless or wired 

30 communication network on a single hardware platform at the base station 
(infrastructure end) or the user terminal (consumer terminal end). 
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The invention provides a communication apparatus that can operate over a 
plurality of telecommunication physical layer standards, radio frequency bands, data- 
rates, and user-programmed or network-programmed services. The functionality of a 
device constructed in accordance with the invention is defmed as a set of software 

5 modules, each including a combination of "personalities" for the hardware devices of 
the apparatus. The software modules comprise object code that executes on a 
microprocessor, a digital signal processor, or on a heterogeneous reconfigurable 
multiprocessing logic module. The invention enables the use of a single hardware 
platform to operate over a variety of bands, standards, and data rates, while delivering 

10 a variety of user-programmed or network-programmed services. Furthermore, the 
invention can be reconflgiured in the factory or field through various means including 
but not limited to factory/point-of-sale programming, remote control, and over-the-air 
or over-the-network download. 

Thus, the invention significantly reduces the number of product platforms 

1 5 required to span a large product offering space for the wired or wireless 
communication market. 

BRIEF DESCRIPTION OF THE DRAWINGS 

For a better understanding of the invention, reference should be made to the 
20 following detailed description taken in conjunction with the accompanying drawings, 
in which: 

FIGURE 1 illustrates a digital communication modem that may be 
implemented in accordance with the prior art. 

FIGURE 2A illustrates an architecture used to implement a prior art digital 
25 communication modem. 

FIGURE 2B illustrates a control strategy used to implement a prior art digital 
communication modem. 

FIGURE 3 illustrates the Energy-flexibility Gap associated with the prior art. 
FIGURE 4A illustrates a reconfigurable digital wireless communication device 
30 in accordance with an embodiment of the invention. 
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FIGURE 4B illustrates a reconfiguration/'controi methodology used in 
conjunction with a reconfigurable digital wireless communication device of the 
invention. 

FIGURE 5 illustrates the energy-flexibility characteristic of heterogeneous 
S reconfigurable multiprocessing hardware constructed in accordance with the invention. 
FIGURE 6 illustrates the application of heterogeneous reconfigurable 
multiprocessing hardware of the invention to a communication system. 

FIGURE 7 illustrates a heterogeneous reconfigurable multiprocessing 
hardware architecture in accordance with an embodiment of the invention. 
10 FIGURE 8 illustrates a detailed embodiment of the architecture of Figure 7. 

FIGURE 9A illustrates a Reconfigurable Data Processing Kemel in accordance 
with an embodiment of the invention. 

FIGURE 9B illustrates configurable architectures that may be fornied in 
accordance with an embodiment of the invention. 
15 FIGURE 10 illustrates profiling of computationally intensive fimctions in 

accordance with an embodiment of the invention. 

FIGURE 11 illustratesprofilingcommonality of fimctions across standards in 
accordance with an embodiment of the invention. 

FIGURE 12 illustrates the characterization of data processing kernels with 
20 variable and invariant components in accordance with an embodiment of the invention. 
FIGURE 13 illustrates profiling data flow among data processing kernels in 
accordance with an embodiment of the invention. 

FIGURE 14 illustrates the method of configuring one or more products in 
accordance with an embodiment of the invention. 
25 Like reference numerals refer to corresponding parts throughout the drawings. 

DETAILED DESCRIPTION OF THE INVENTION 
The efficiency of a multi-standard device may be dramatically improved via a 
new implementation architecture shown in Figure 4A. The new architecture also 
utilizes an RF subsystem 26 to provide one or more intermediate frequency signals. 
30 and an analog-to-digital converter 28. whose output is placed on bus 52. Signals fi-om 
bus 52 are routed to heterogeneous reconfigurable hardware multiprocessor 66 and 
software programmable processors 70. In this embodiment, processors 70 include 

6 
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DSP microprocessor 72 and control microprocessor 74. The new architecture exploits 
heterogeneous reconfigurable multiprocessor 66. which includes a pool of parallel 
heterogeneous hardware signal processors referred to as signal processing kernels. The 
kernels perform the more computationally intensive signal processing operations of a 

5 set of standards, applications and services of interest, and are selected and configured 
in a modular, non-redundant manner. The individual signal processing kernels and 
their interconnections can be quickly reconfigured, so that device 1 00 can quickly 
switch from one standard, application, and/or service of interest to another. DSP 72 
performs the less computationally intensive signal processing functions, while 

10 microprocessor 74 performs control and other functions. A communication bus 52 
links microprocessor 74, digital signal processor 72, and heterogeneous reconfigurable 
muhiprocessing hardware 66. Each hardware device is controlled by a corresponding 
software module 1 10. 

The commimication apparatus of the invention can change its function 

15 temporally via a mechanism that is under software control. Figure 4B illustrates the 
executive code 88 and software modules 1 10 of the invention operating in connection 
with the hardware platform 100 of Figure 4A. As shown in Figures 4A-B, selected 
software modules 1 10 execute on different hardware components. Thus, for example, 
software kernels 1 lOA-B are executed on heterogeneous reconfigurable 

20 multiprocessing hardware 66. software kernels 1 lOC-D are executed on the digital 
signal processor 72. and software kernels 1 lOE-F are executed on the microprocessor 
74. Executive code 88 further facilitates the management of the dataflow into and out 
of heterogeneous reconfigurable multiprocessor 66, as well as the functionality of 
heterogeneous reconfigurable multiprocessor 66. Thus, all the dataflow and control 

25 flow of all computational resources may be controlled and reconfigured via a software- 
programmable engine. 

The software modules 1 10 define a superset of signal processing and control 
fimctions required by the suite of standards, applications, and/or services of interest, in 
accordance with the matrix of Figiu^ 4B. In one embodiment, the software modules 

30 1 10 are platform*dependent. and are thus in the form of object code optimized for the 
platform imder consideration. They reside either in the product platform memory or 
are downloadable over-the-network, over-the-air. on the manufacturing floor, or point- 
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of-sale station. Typically, they are stored in a platform-dependent hierarchy matrix, 
wherein the hierarchy has the following form: 



10 



Product-Platform X 

Country of Operation 
Frequency Band 
Services 
Data-Rates 



♦♦specifies constituent processor cores, buses, 
etc. 

♦♦specifies allowed subset of applications 
♦♦specifies allowed fi-equency band of operations 
♦♦specifies allowed services in a country 
♦♦specifies data-rate support capability 



Layer 1 Functionality 

Object code to download for platform 
Layer 2 Functionality 



The apparatus of Figure 4A-B is preferably operated in the following marmer. 

15 The country of operation, allowed frequency band, allowed services, allowed data 
rates, etc. for a device are specified. Software modules to support these specifications 
are then selected. The software modules are then downloaded to the device of Figure 
4A. This may be accomplished over-the-air, over-the-network, on the manufacturing 
floor, or at a point-of-sale station. The software kemels are then tested on the device 

20 and the system is initialized for operation. 

The resultant architecture and control strategy effectively fills the energy- 
efficiency gap 92 as shown in Figure 5. Although the heterogeneous reconfigurable 
multiprocessor does not attain the efficiency of fixed-ftmction hardware due to the 
overiiead of resources used for reconfiguration (i.e.. to accommodate multiple 

25 services), this overiiead is minimal relative to that of the more programmable/flexible 
devices shown. Digital signal microprocessors and embedded processors are much 
less efBcient, due to the overhead of supporting instruction-set programmability. The 
efficiency of programmable logic devices is even lower, since they must employ a 
majorit>' of resources to permit general, fine-grained reconfiguration that exceeds the 

30 needs of the application space acconunodated by the invention. 

The criteria used to map the requisite signal processing functions into the 
architecture of Figure 4 A will vary, but will typically depend upon several key factors, 

8 
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including: power efficiency, memory requirements, throughput, latency, and 
flexibility. Function mapping will also rely upon product application space, multi- 
nation, multi-standard, or multi-service product operation requirements, product 
development time, and time-io-market requirements. Thus, the same hardware can be 
5 programmed and/or reconfigured to implement the same dataflow and control flow, 
but with a new set of parameters, that may be product quality-, channel-, standard-, 
service-, or country-of-operation-dependent. 

An example mapping of the Digital Front End Processor. Detector / 
Demodulator and Parameter Estimator of Figure 1 onto the architecture of Figure 4 A 
10 is given in Figure 6. This figure illustrates a digital communication modem 140 that 
may be implemented in accordance with an embodiment of the invention. A 
transmitter transmits a signal to a channel, which may be a wireless or. physical 
channel. The transmitted signal is received at the modem 140. where it is initially 
processed by a Radio Frequency (RF) subsystem 26. The RF subsystem 26 performs 
15 analog mixing, analog filtering, and analog gain control functions. The analog signal 
from the RF subsystem 26 is then converted to an equivalent digital signal with the 
analog-to-digital converter 28. 

The output from the RF subsystem 26 is one or more bandpass signals 
(intemiediate frequency signals), which are then digitized by the analog-to-digital 
20 converter 28, which is typically implemented as a free-running analog-to-digital 
converter. The output from the analog-to-digital converter 28 is placed on a bus 55, 
which may be reconfigurable. Signals from the bus 55 are routed to the hardware- 
reconfigurable digital front-end processor 142. Preferably, the reconfigurable 
processor 142 is implemented with a heterogeneous reconfigurable multiprocessor as 
25 discussed above. 

The reconfigurable digital front-end processor 142 performs channel selection, 
sample-rate conversion, digital down-conversion, and digital filtering. This is 
achieved through the use of multi-rate digital signal processing techniques, software- 
programmable filter coefficients, rate-conversion, chaimelization. and loop filter 
30 parameters. The output of reconfigurable processor 142 consists of complex IQ 
signals, which are then fed to a demodulation-detection-parameter estimation 
processor 144. The functionality of processor 144 is distributed between the software- 

9 
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programmable digital signal processor 72. the control microprocessor 74. and the 
hardware-reconfigurable co-processor 146. 

Processor set 144 perfonns demodulation/detection functions including but not 
limited to despreading, descrambling, dechannelizing, searching, and integrate-and- 

S dump detection. Processor 144 also perfonns parameter estimation functions 
including but not limited to channel estimation, correlation, pilot-signal searching, 
frequency-offset estimation, phase-offset estimation, and timing-error estimation. 
Through a profiling methodology discussed below, highly computation-intensive 
aspects of parameter estimation are allocated to the hardware-reconfigurable co- 

10 processor 146. while the balance of computations is allocated to the DSP 

microprocessor 72 and control microprocessor 74. For example, in a code-division- 
multiple-access (CDMA) application, high-speed chip-rate processing is performed by 
co-processor 146, while lower-speed channel estimation is performed by DSP 72. 
Co-processor 146 is architecturally distinct from processor 142 due to the 

IS autonomy of the respective signal processing functions, as well as differences in 

sample rates and other factors. The output from hardware-reconfigurable co-processor 
146 is fed to the bus SS« which interfaces with the digital signal processor 72, control 
microprocessor 74, and memory 76. Memory 76 includes segments for the executable 
code for DSP 72, control microprocessor 74, and for hardware and other configuration. 

20 The digital signal processor 72 and microprocessor 74 are completely sofbvare 

programmable, and offer the most flexibility with respect to which applications can be 
executed on the architecture. 

Processor 144 thus performs traditional inner receiver functions, which can be 
called via software routines from an instruction set processor. Thus, processor 144 is 

25 primarily software-programmable with requisite hardware reconfigurability, primarily 
with respect to dataflow between estimators and signal conditioning elements. 

Figure 7 illustrates an architecture 1 SO in accordance with an embodiment of 
the invention. The architecture 150 includes a Program Control Unit (PCU) 151 
which stores instructions and data. A heterogeneous reconfigurable multiprocessor 

30 160 performs computationally intensive signal/data processing functions, and is 

representative of either the hardware-reconfigurable digital front-end processor 142, or 
of the hardware-reconfigurable co-processor 146 of Figure 6. The heterogeneous 

10 
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reconfigurable muitiprocessor 160 is comprised of a set of heterogeneous data 
processing kernels 1 62 and a reconfigurable data router 168. The configuration of the 
heterogeneous data processing kernels 162 is determined by control bus 132, while the 
configuration of data router 168 is determined by control bus 154. 
5 The data flow and control flow computation kernels 1 62 are of varying 

granularit\\ from simple arithmetic operators, such as adders and multipliers, to more 
complex data flow/control flow operations, such as a complete Viterbi Algorithm Add- 
Compare Select and Fast Fourier Transform Butterfly units. The composition of 
computation kemels 162 is further discussed below. 
10 Figure 8 illustrates architecture 150 in greater detail. The PCU 151 receives a 

module of executive code 88 from microprocessor 74 of Figure 6. The executive code 
is a segment of the microprocessor executable programs stored in memory that 
orchesttates overall configuration and fiinctionality. A controller 156 configures a set 
of quasi-fixed-function logic kemels 162A-C. In a typical application, the executive 
1 5 code 88 executes on the control microprocessor 74 or DSP microprocessor 72, and the 
functionality of controller 156 is allocated to the microprocessors and associated 
peripherals such as memory and various bus interfaces. Figure 8 further illustrates that 
individual kemels may be interconnected either directly, as per representative path 
1 64, or via reconfigurable data router 1 68. Reconfigurable data router 1 68 ftirther 
20 receives input data from and delivers output data to bus 55. Reconfigurable data router 
168 is controlled by the data router manager 1 58 via control bus 1 54. and in turn via 
controller 1 56 and executive code 88 of Figure 4A. 

The portfolio of kemels 162 is determined by first profiling the applications of 
interesu as discussed below, Kemels 162 are compute engines, and their nature as x 
25 well as that of their interconnection is governed by any bit-slice, nibble-slice, and ) 
word-slice routing control mechanism, including, but not limited to. a programmable' 
bus. A preferred canonical architecture for the kemels 162 is depicted in Figure 9A. 
A configurable arithmetic logic unit (ALU) 186 performs the necessary signal 
processing operations. A local memory 182 serves as a high-speed cache. Data 
30 Sequencer 1 84 orchestrates the flow of data between local memor>' 1 82 and ALU 1 86. 
For further illustration. Figure 9B shows several representative or available 
configurable architectures that may be implemented by one or more kernels 162. 

11 
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Kernels 162 can be reconfigured via control lines 152 to determine what operations are 
possible. Similarly, the reconfigurable data router 168 of Figure 8 can be controlled to 
effectively re-order the sequence of signal processing operations performed by kernels 
162. 

5 Processor 160 can be reconfigured on a time-scale optimal for the application. 

For example, the mobile terminal of a user who roams from one geographic region to 
another having a different radio standard may be required to reconfigure over a period 
of seconds or minutes. On the other hand, the mobile terminal of a user who requests 
a real-time change of service, e.g., videoconference following from a voice coiuiection 

10 with a common party, may be required to reconfigure in milliseconds or faster. 

During execution, the architecture 160 operates as a heterogeneous (since the 
processors are not identical) multiprocessing (since there are multiple processing units) 
machine, with at least two concurrently operating processing units. Based on the 
selection of kernels 162, an augmented instruction set is defined for the heterogeneous 

IS reconfigurable multiprocessor 160. This augmented instruction set can be created, for 
example, by using a wide-word instruction by appending bits to an existing instruction 
word, with the new bit fields exclusively devoted to the decoding of instructions for 
the control and data flow for the heterogeneous reconfigurable multiprocessor. The 
instruction word, when decoded, feeds control units 156 and 158 of Figure 8. 

20 Controller 1 56 performs the role of taking the decoded instruction fields and 
configuring the kernels 162 and reconfigurable data router 168. via data router 
manager 158. The control of the reconfigurable data router 168 is effected via a 
control word, which, in a preferred implementation, is a bit field extracted from the 
instmction word. 

25 The heterogeneous reconfigurable multiprocessor 160 is designed according to 

a method referred to as profiling. Profiling includes the first step of surveying all 
signal processing and control functions required to accommodate the standards, 
applications, and/or services of interest. The most computationally intensive of these 
functions are then targeted to the heterogeneous reconfigurable multiprocessor 160, 

30 while the remaining functions are targeted to the DSP microprocessor 72. Typically, 
computational intensity is enumerated in units of millions of operations per second 
(MOPS). For example. Figure 10 depicts function 204 A which could be provided by 

12 
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the hardware reconfigurable processor 142. functions 204 B-E to be provided by 
coprocessor 146. and corresponding MOPS required by each function 204. These 
metrics are calculated for the various pertinent signal processing datapaths. 

Additionally in this step, computationally intensive functions are further 

5 categorized according to type of operation, e.g., arithmetic/logical, control, and 
memory access. For each category, characteristic power per MOPS is determined for 
hardware or software implementation from vendor data, analysis, or other means. 
Power, e.g., milliwans, required per function is thereby characterized for 
implementation in both reconfigurable hardware or in software (i.e.. running on a 

10 processor whose power-per-MOPS has been characterized). In addition, the 
corresponding code size (and therefore memory requirement) for software 
implementation is determined. From the above, and from budgeted power and 
memory resources, allocation of processing operations to hardware and software 
processors can be determined. 

IS The entries in spreadsheet 200 correspond to static operation for a particular 

standard, i.e., to a specific time within a dynamic operational scenario. The analysis of 
Figure 1 0 must be repeated as necessary to reflect important temporal variations 
during representative/ realistic scenarios for all standards, applications, and/or services 
of interest. The results of these analyses must be interpreted to reveal additional 

20 critical metrics of computational intensity, including for example but not restricted to 
average and peak MOPS for each relevant operation. This enables the requisite 
specifications for the hardware and software processing resources to be further 
evaluated. 

The second step of profiling involves analysis of conunonality of signal 
25 processing functions across the standards, applications, and/or services of interest. 
This is represented by example in Figure 11. Included in abridged spreadsheet 220 are 
representative standards/ applications, and respective relevant signal processing 
functions within the general category of parameter estimation. Figure 1 1 shows, for 
example, that a Windowed Average Energy Estimator is required by seven of the listed 
30 standards. The designer would research the respective requirements of each of these 
seven standards to determine the required superset and seven subsets of functionality. 
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The third profiling step* defining the data processing kemeis necessary to serve 
the standards, applications, and/or services of interest, is shown conceptually in Figure 
12 for a different set of standards. In general, each unique type of kernel includes a 
combination of variable and invariant functionality. The invariant component is 

5 determined by the above steps to be common across the standards, applications, and/or 
services of interest, while the variable functionality is that determined necessary to 
adapt to the various standards, applications, and/or services of interest. Each kernel is 
designed to include sufficient control and interface functionality to permit 
reconfiguration according to the end operational scenario. 

10 The intercormection of kemeis must also be determined from profiling as 

shown in the example abridged matrix 260 of Figure 13. The rows and colunms of 
matrix 260 show a representative set of hardware signal processing kemeis that have 
been defined according to the above profiling steps, along with all connections 
necessary to serve a representative set of CDMA-based wireless communication 

IS standards. Along the axes of matrix 260. signals generally flow fix)m bottom to top, or 
from right to left with exceptions as indicated. Each cell containing an "X" represents 
a required interface between the respective kemeis. It can be seen that in the vicinity 
of the diagonaU intercoimections are tightly clustered, as for example cluster 262. 
Other types of interconnections include parallel cormections, e.g., 264, and isolated 

20 connections, e.g., 266. Where common across all standards, applications, and/or 
services of interest, these interconnections are made directly, as represented by 
connection 164 of Figure 8. Conversely, connections that must change as a fimction of 
standard etc. mxxst be effected by the reconfigurable data router 168 of Figure 8. 

This analysis reveals certain considerations of interest to the implementation 

25 designer. For example, in matrix 260. cluster 262 indicates short connections in the 
silicon implementation, while cluster 266 indicates longer connections. Cluster 264 
indicates a set of parallel connections. 

To summarize, reconfiguration of multiprocessor 160 is effected by i) selection 
of hardware processing kernel types, ii) control of the variable kernel functionality, 

30 and iii) control of the reconfigurable data router 168. 

Once the kernel types and interconnections have been determined, the 
multiplicity of each kernel type needs to be determined, as illustrated in Figure 14. A 

14 
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kernel pool 280 includes a sufficient number of each type of kernel to permit the 
assembly of multiple datapaths 290. In turn, a sufficient multiplicity of datapaths 290 
is assembled to accommodate the signal processing requirements of a particular 
standard, service or application. This is illustrated for a number of representative 
5 applications and/or products 300. The portfolio 300A-D can represent either a single 
product having multi-mode/standard /application capability, or multiple, separate 
products based on common underlying hardware and software resources. 

Thus, a manufacturer can enjoy mass customization based on a common 
product "platform." Initial or subsequent configuration can be performed in the 
10 factory, at point-of-sale, or by the user after delivery. Post-deliver>' customization can 
be based upon any of a number of techniques, including but not limited to smart card, 
wired interface, and over-the-air/over-the-network download and billing. 

In Figure 14, the various kernels 162 are interconnected by the reconfigurable 
data router 1 68 of Figures 7-8. For simplicity, however, reconfigurable data router 
15 1 68 is not shown in Figure 1 4. 

For convenience, the above discussion has utilized examples relating to 
wireless communication systems. The principles discussed apply directly to other 
applications including but not limited to multimedia, security, and networking. 
Furthermore, the receive path only has been illustrated; the apparatus and method of 
20 the invention apply directly to the transmit path. 

Those skilled in the art will recognize a number of benefits associated with the 
disclosed architecture. The architecture provides the ability to reconfigure a single 
product platform for multiple standards, applications, services, and quality-of service, 
instead of developing multiple hardware platforms to establish the same collective 
25 functionality. The architecture also provides the ability to use software programming 
techniques to reduce product development time and achieve rapid and comprehensive 
product customization. Thus, new services can be provided via software upgrades. 
The apparatus of the invention allows a network operator or service provider to control 
a communication lenninal's capabilities. Equipment manufacturers may exploit the 
30 invention to create software-defined communication appliances. 

Advantageously, the architecture of the invention optimally combines fixed- 
function and reconfigurable logic resources. The system has reconfigurable control 
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and datapaths. The invention extends the performance efficiency of microprocessors 
and digital signal processors via the augmentation of data paths and control paths 
through a reconfigurabie co-processing machine. The reconfigurability of the data 
path optimizes the performance of the data flow in the algorithms implemented on the 

5 processor. 

The architecture efficiently redirects functions previously nmning on a Fixed 
Function Data Arithmetic Logic Unit to a more flexible Heterogeneous Reconfigurabie 
Multiprocessing Unit. The invention does not depend upon the fine-grained 
reconfigurability of existing programmable logic devices, and so does not have the 
10 problem of such devices of the area and power of the chip being dominated by the 
routing resources. Furthermore, the invention does not substantially rely on 
instruction-set programmable processors. Instead a quasi-fixed set of computational 
resources that span the signal processing requirements of the standards, applications, 
and/or services of interest are configured together in a reprogrammable manner. This 
1 5 architecture can be applied to, implement signal processing and/or control of 
processing applications. Ideal applications include but are not limited to digital 
communication, networking, encryption/security, and multimedia applications. 

The invention reduces product development time via the use of a 
reconfigurabie processor that is programmable via a high-level language. This avoids 
20 the problem of having to create new processors each time the algorithms considered 
for new applications change or push the performance envelope of existing processor 
architectures. The invention enables mass customization of relevant consumer and 
infrastructure products. 

The foregoing description, for purposes of explanation, used specific 
25 nomenclature to provide a thorough understanding of the invention. However, it will 
be apparent to one skilled in the art that the specific details are not required in order to 
practice the invention. In other instances, well-known circuits and devices are shown 
in block diagram form in order to avoid unnecessary distraction fi-om the underlying 
invention. Thus, the foregoing descriptions of specific embodiments of the present 
30 invention are presented for purposes of illustration and description. They are not 
intended to be exhaustive or to limit the invention to the precise forms disclosed; 
obviously many modifications and variations are possible in view of the above 

16 
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teachings. The embodiments were chosen and described in order to best explain the 
principles of the invention and its practical applications, the thereby enable others 
skilled in the art to best utilize the invention and various embodiments with various 
modifications as are suited to the particular use contemplated. It is intended that the 

5 scope of the invention be defined by the following claims and their equivalents. 
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1 . A digital wireless communication device, comprising: 
a radio-frequency (RF) subsystem: 

5 an analog-to*digitai converter, 

a software-programmable processor: 

a heterogeneous reconfigurable multiprocessing logic circuit: and 
a bus connecting said software*programmable processor and heterogeneous 
reconfigurable multiprocessing logic circuit. 

10 

2. The digital wireless communication device of claim 1 wherein the software- 
programmable processor is selected from the group comprising a digital signal 
processor and a central processing unit. 

!5 3. The digital wireless conmiunication device of claim 1 wherein said software- 
programmable processor executes a set of software modules specifying a set of 
wireless conmiunication applications, standards, and services to be implemented. 

4. The digital wireless communication device of claim I wherein said 
20 heterogeneous reconfigurable multiprocessing logic circuit comprises a plurality of 
programmable data processing kernels, each data processing kernel of said plurality of 
data processing kernels including a data sequencer, a local memor\\ and an arithmetic 
logic unit. 

25 5, The digital wireless communication device of claim 4 wherein said plurality of 
programmable data processing kernels are programmed to implement a set of wireless 
communication applications, standards, and services. 

6. A method of progranuning a digital wireless communication device to support 
30 a wireless communication application, standard, and service, said method comprising 
the steps of: 
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selecting a set of parameters and values required to implement a specified 
wireless communication application, standard, and service; 

downloading said set of parameters and values to a digital wireless 
communication device including a heterogeneous reconfigurable multiprocessing logic 
5 circuit with a plurality of re-programmable data processing kernels: and 

operating said plurality of re-programmable data processing kernels in 
accordance with said set of parameters and values to support said specified 
communication application, standard, and service. 



10 7. The method of claim 6 wherein said selecting step includes the step of 
selecting a set of parameters and values from the group comprising: a country of 
operation, a frequency band, and a data rate. 

8. The method of claim 6 wherein said downloading step includes the step of 
15 downloading said set of parameters over a wireless communication channel. 

9. The method of claim 6 wherein said downloading step includes the step of 
downloading said set of parameters at the time of manufactiu^. 

20 10. The method of claim 6 wherein said downloading step includes the step of 
downloading said set of parameters at a point of sale. 
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